Dimension Reduction of the Explanatory Variables in Multiple Linear Regression
نویسندگان
چکیده
In classical multiple linear regression analysis problems will occur if the regressors are either multicollinear or if the number of regressors is larger than the number of observations. In this note a new method is introduced which constructs orthogonal predictor variables in a way to have a maximal correlation with the dependent variable. The predictor variables are linear combinations of the original regressors. This method allows a major reduction of the number of predictors in the model, compared to other standard methods like principal component regression. Its computation is simple and quite fast. Moreover, it can easily be robustified using a robust regression technique and a robust measure of correlation.
منابع مشابه
Forecast generation model of municipal solid waste using multiple linear regression
The objective of this study was to develop a forecast model to determine the rate of generation of municipal solid waste in the municipalities of the Cuenca del Cañón del Sumidero, Chiapas, Mexico. Multiple linear regression was used with social and demographic explanatory variables. The compiled database consisted of 9 variables with 118 specific data per variable, which were analyzed using a ...
متن کاملPenalized Bregman Divergence Estimation via Coordinate Descent
Variable selection via penalized estimation is appealing for dimension reduction. For penalized linear regression, Efron, et al. (2004) introduced the LARS algorithm. Recently, the coordinate descent (CD) algorithm was developed by Friedman, et al. (2007) for penalized linear regression and penalized logistic regression and was shown to gain computational superiority. This paper explores...
متن کاملSome Tools for Linear Dimension Reduction
Dimension reduction refers to a family of methods commonly used in multivariate statistical analysis. The common objective for all dimension reduction methods is essentially the same: the reducing of the number of variables in the data while still preserving their information content, however it is measured. In linear dimension reduction this is done by replacing the original variables with a l...
متن کاملEVALUATION OF CONCRETE COMPRESSIVE STRENGTH USING ARTIFICIAL NEURAL NETWORK AND MULTIPLE LINEAR REGRESSION MODELS
In the present study, two different data-driven models, artificial neural network (ANN) and multiple linear regression (MLR) models, have been developed to predict the 28 days compressive strength of concrete. Seven different parameters namely 3/4 mm sand, 3/8 mm sand, cement content, gravel, maximums size of aggregate, fineness modulus, and water-cement ratio were considered as input variables...
متن کاملبهکارگیری متغیرهای پنهان در مدل رگرسیون لجستیک برای حذف اثر همخطی چندگانه در تحلیل برخی عوامل مرتبط با سرطان پستان
Background and Objectives: Logistic regression is one of the most widely used generalized linear models for analysis of the relationships between one or more explanatory variables and a categorical response. Strong correlations among explanatory variables (multicollinearity) reduce the efficiency of model to a considerable degree. In this study we used latent variables to reduce the effects of ...
متن کامل